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ABSTRACT 

A self-replicating system where the elements belonging to a solution category can 
replicate themselves by copying their own informations, is considered. The informa- 
tion carried by each element is defined by an element of all the n multiple tensor 
product of a base space that consists of m different base elements. We assume that 
in the replication the processes of copying each base information are the same and 
independent from one another and that the copying error distribution in each process 
is characterized by a small variation with a quite small mean value. Concentrating 
on the number fluctuation of the informations in the copying process, we analyze 
the time evolution of the system. We illustrate the change of averaged number of 
informations carried by system objects and the variation of the number distribution 
as a function of time. Especially, it is shown that the averaged number of information 
grows in general after large number of generations. 



UOSTP-99-005 



*email address: dsbak@mach.scu.ac.kr 



1 Introduction 



One of the essential features of the life system is that most of its element do possess 
an ability to replicate themselves by copying their own informations. The copying 
mechanisms in each element are mostly the same but its effectiveness is not perfect 
due to chemical and quantum fluctuations. If it were perfect, there should not be any 
change of the species of the system in its time evolution. The random fluctuation 
involved in this copying process is, hence, another important ingredient of the system 
in understanding its time evolution. 

The natural consequences of the replication with random fluctuation are of great 
interest since they may explain main characteristics of our life system. Admitting the 
complication in treating the real life system, one may introduce a simplified version 
of the system that, however, comprises the essential features of our life system on the 
aspect of the replication and the distribution of the information numbers. 

In this note, we shall consider the following simplified version. The system consists 
of objects that carry a characteristic information defined as an element belonging to 
all the n multiple tensor products of a base space for all the nonnegative integer n. 
We shall then define the information number of an object by N when it belongs to 
N-multiple tensor product space. The elements in the base space are fixed and their 
total number, m, is finite. In the real life system, for example, these base elements 
correspond to the four base acids. Namely, they are adenine, guanine, cytosine, and 
thymine. The information ensemble space is the place where the dynamics of interest 
occurs. A solution (moduli) space is defined as a subset of the information ensemble 
space with a restriction to the objects carrying the information element that may 
replicate themselves. We assume that the replication mechanism of a base element 
is mostly the same and the unit processes copying base information are independent 
events. Due to the chemical and quantum mechanical fluctuations, there might be 
copying errors that maps one specific element of the base space to an element in the 
ensemble space. Of course, the errors are quite small but still to be a main source 
of the dynamical flow of the system after generations. The errors include the change 
of the original base element into another base element, omission of the original base 
element and mapping the base element into more than a base element. The probability 
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of mapping the base element to more than two base units is quite small compared to 
the total probability of errors and, hence, the mean of the fluctuation is very close to 
the case of no errors. The mean of the errors is more or less inclined to the direction of 
growing of average numbers after one copy event because of the asymmetry reflected 
in the fact that there are no counterparts of mapping a base element to more than 
two base elements about the symmetry point of the mapping to just one element. 

The unit time between subsequent generations will be set all the same and time 
flow will be measured by the number of generations. This implies that all the objects 
in the system at certain time have exactly the same numbers of ancestors born after an 
initial time. The next generation will be defined by the set of two identical descendents 
from each object of the original system which belongs to the solution category. The 
solution space is determined as a function of environment and the system itself. The 
properties of the solution space are not known mostly for realistic life systems. We 
shall introduce a minimal assumption that the density of solutions over the total 
ensemble space can be defined as a smooth function. This means that the solutions 
are densely distributed over the whole information configuration space. 

Basic features of the time evolution of the system are as follows. Because of the 
copying errors, the consequence of all the replications of the objects of the original 
generation belonging to the solution space will be the next generation set whose el- 
ements may or may not belong to the solution space. After a unit time, the next 
generation again replicate its elements once they belong to the solution space. Re- 
peating this process from generations to generations, the generations will flow in the 
information configuration space quite as the time flies, but the flow will be restricted 
to regions of near solution space. The errors propagate and the initial set diffuses into 
the new arena in the configuration space. What are the properties of the dynamics 
of the system as the number of generations grows? 

In this note, we shall concentrate on the information number fluctuation of the 
system objects as a result of the replication. Especially, we will consider the averaged 
information number and their variation as a function of time. In this way, we will 
demonstrate that there is a finite probability of the appearance of objects carrying a 
large number of information compared to that of an initial ancestor. 
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2 Self replicating system and its evolution 



As mentioned in the introduction, the self-replicating system consists of objects that 
carry an information element belonging to the information configuration space. To 
define the configuration space, we first introduce a base space B with a finite number 
of elements: 

B = {b u b 2 ,---,b m }. (2.1) 

The information configuration space is then the direct sum of all n-multiple tensor 
product, 

oo 

E = ®B n , (2.2) 

n=0 

where B n denotes the n multiple tensor product of the base space B. For each element 
of the information configuration space, we specify its number of information by the 
order of the tensor product where it belongs. As a subset of the ensemble space, the 
solution space Sk is composed of the information elements, whose objects have an 
ability to replicate themselves from the k-th generation to the (k + l)-th generation. 
As remarked earlier, we will assume that the solution space elements are densely 
distributed over the information ensemble space, so that one may define the density 
of the solutions p(n, k) 

as a smooth function of n where N(A) is the number of elements in a set A and Z p is a 
normalization constant. The number fluctuation involved in the replication event of a 
base element takes the a distribution d(l + po, <j ) with a mean 1 + p and a standard 
deviation gq. If one denotes the probability of I base elements resulted from a unit 
base by pi for a copy, one may infer that po ~ p 2 = p {$>i — po <C p) , p% <C p and the 
contribution of all the higher mode may be ignored as explained earlier. By an explicit 
computation, one finds that <7q ~ 2p and po ~ 2p 3 +p 2 — po <7q. Here, the variation 
(Tq is much smaller than one, and we use an approximation that one is describing 
the distribution as if n is a continuous parameter. When one replicates an object 
carrying n information, the number fluctuation associated with this is described by 
the normal distribution 

5n = z(n(l + p ),y/na ) (2.4) 
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owing to the independence of all the replication events as well as the central limit 
theorem. The informations of the system is also given as a subset A k of the configu- 
ration space at the k-th generation. The distribution of the system informations over 
the ensemble space at the k-th generation is then described by 

<f)(n,k) = Z^N(A k HB n ), (2.5) 

where 0(n, 0) represents the initial distribution of the informations and denotes 
a normalization factor. With help of the distribution function in (|2.4j) , one finds 
the density distribution of the system in the next generation is determined by the 
diffusion process from the set A k fl S k at the k-th generation. Namely, all the element 
in A k fl Sk are doubled by replications and the number distribution of each descendent 
is described by (|2.4|). This is summarized in terms of the density description by 

POD 

cf)(n,k + l)= dn'G(n,n')p(n',k)(j){n',k). (2.6) 
Jo 

where G(n,n') is the propagator of the distribution 

G ( n ' n ) = 7k=, Ex P( ^-r~2 }• ( 2 - 7 ) 

In ( |2.6| ), the factor two by the doubling as a result of replication is absorbed into 
the normalization factor of the density function. By the mathematical induction, the 
density of the system information at arbitrary generation is obtained from the initial 
data by 

POO 

<j>{n,k)= I dn'P(n,n';k)(j)(n',0), (2.8) 



o 

where the propagator is defined as 



P(n,n';k) = \J[ J™ dmj I J[ G{n j+1 , nfipin^j) J , (2.9) 



with 71& = n and n = n'. The propergator in (|2.6| ) can be represented in a differential 
form for small fiQ and <Jq. To measure the smallness of the mean and the variation, 
we introduce new parameters fi and a by 

Ho = Ate, o"o = a ' 2t (2-10) 



such that the new parameter o 2 is 0(1). To obtain the differential form of the 
propagation, we define the infinitesimal time evolution by 

i/,(s,t + e)= dsG(s,s')M,t), (2.11) 
Jo 

where one measures the time by e multiplied by the number of generations k. Intro- 
ducing a variable £ by 

,-«i+ri = (y/ - et (2 . 12) 

cry s 

one may rewrite the above integral as 

= r^ 2 / 2 &^[(l- / i e )^+ S (26£+26 2 £ 2 -/i e )^+26 2 £ 2 S Vl+0( e 2 ) (2.13) 



with b = ay/e/\Jk{l + fie)s. Integrating over the ^-variable, one may obtain the 
following differential equation, 



a\ d 2 d, d 



i/>(s,t), (2.14) 



where the terms of 0(e 2 ) are ignored. Alternatively, the above equation may be 
presented by 

^(r 2 ,t) = -iJ(p,x)^(r 2 ,t) (2.15) 



with the Hamiltonian 

H(p, x) ee ^p 2 + • p + p = £(p + ^x) 2 + ^x 2 (2.16) 



where the definition r 2 = s = x\ + x\ + x\ + x\ and the four dimensional gradient 
V = ip are introduced. Although the Hamiltonian is not a Hermitian operator, the 
time evolution of the system is well defined. 

Inclusion of the density contribution to the time evolution is a complicated prob- 
lem. Some speculations on the dynamics with some generic density function will be 
relegated to the conclusion. Here, let us consider the case that it is possible to write 
the density function in a form 

p(s,t) =PoW(l -eU(s,t) +0(e 2 )) (2.17) 
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where the function U(s,t) satisfies the requirement 

1 f L 

e lim - / ds\U(s,t)\ < 1. (2.18) 

L-*oo L JO 

Note that the overall factor po w iH be absorbed into the normalization of the distribu- 
tion function <ft without changing the probability amplitude. Thus the time evolution 
of the system function <p(s, t) for this case can be easily identified as 

0(r 2 , t + e) = [1 - eH (p, x)] [1 - dJ{r\ t)]0(r 2 , t) + 0(e 2 ), (2.19) 



from the combination of (|2.6|) (|2.14j) and ( 2.17]) . Hence the differential equation 



describing the time evolution with the density function (|2.17 ) is 

J^(r 2 , t) = -[H(p, x) + U{r\ t)]0(r 2 , t). (2.20) 

As discussed in the introduction, the form of the potential U(s, t) may depend upon 
the system function 0(s, t) because the system itself works as an environmental el- 
ement. However, we won't further discuss this possibility in the following for the 
simplicity. 

3 Solution and statistics of self-replicating system 

In the preceding section, we discuss the time evolution process and the differential 
equation governing the system dynamics. For the case that one may characterize the 
system by the potential U for the solution density contribution, the time evolution 
of the system follows the differential equation ( |2.20| ). Let us first consider the case 
U — 0, which implies that the density function is constant in its argument. Namely, 
the solution space distribution is uniform over the information configuration space. 
To find the dynamics, it is convenient to construct the kernel K(r 2 ,r' 2 ;t) that is 
defined as a solution of the equation (|2.20|) with the initial condition 



o (r 2 ) = 5{r 2 -r' 2 ). (3.1) 

With help of the kernel, the solution for a general initial condition 0(r 2 , 0) is obtained 
with 

0(r 2 , t) = [°° d(r' 2 )K(r 2 , r' 2 ; t)0(r' 2 , 0). (3.2) 
Jo 
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In order to find the kernel, we first introduce the function <p(r 2 ,t) by 

^r 2 ,t) = e^^^r 2 ,t), (3.3) 
and insert this into the equation (|2.20|) . One finds that the equation for now reads 

^0(r 2 ,t) = -iJ(p,x)0(r 2 ,t) (3.4) 

with a new Hamiltonian 

#(p,x) = ^p 2 + ^x 2 . (3.5) 

It is interesting to note that this is the Hamiltonian that describes a simple harmonic 
oscillator in four dimensional flat Euclidean space. The kernel for the equation (|3.4j ) is 
simply a product of the propagator for one dimensional simple harmonic oscillator 0: 

* (X - X ' i() = (2^) 2Ex 4-^^l (x2 + X ' 2)c08h ^- 2X ' X 'l}' (3 ' 6 > 



with the solution of the differential equation ( |3.4| ) 



<7(x, t)= J d 4 x'K(x, x'; t)g(xf, 0). (3.7) 

for an arbitrary initial condition g(x, 0). The kernel for our system is then obtained by 
taking g(x', 0) = e _/ "*' /°" 2 5(r' 2 — r 2 ) in (|3.71 ) and multiplying the factor e Mr2//<j2 . Using 
the integral representation of the Bessel function 0, one finds that the expression for 
the kernel reads 



K(r 2 , r 2 ; t) = e^'" 2 J d 4 x'K(x, x'; t)e~^ 2 5{r' 2 - r 2 ) 

= o Ex P l^[( r2 " r o) - (r 2 + r 2 ) coth^]U( ^ r [° )■ (3-8) 

2rsinh/it H I a 2 lK 0J v 0J P J J V sinh v ; 



where I\{x) is the first-kind Bessel function of imaginary argument. For the consis- 
tency of the equation (|3.2|) , the kernel must satisfy the relation 

K(r 2 , r' 2 ; t + t')= f° d{z 2 )K(r 2 , z 2 ; t)K(z 2 , r' 2 ; if), (3.9) 
Jo 

which may obtained by applying the equation (|3.2|) twice for time intervals [0, t'] and 
[t',t + t'}. Upon using the expression ( |3.8|) , this composition rule may be checked 



explicitly by a straightforward computation with help of the formulas for the definite 
integrals involving the Bessel function |2|]. 

Following the standard rule, the mean M(t) and V(t) variation of the number of 
information are defined by 

/ °° dss(f)(s,t) 



M(t) ee 



V(t) 



JS°d8*<l>{8,t) w2 



M\t). 



(3.10) 
(3.11) 



fS°d a <t>(s,t) 

For the case of the initial condition 4>o(s) = 5(s — Sq), one may compute explicitly 
the mean and variation noting that <p(s,t) is given by K(s,So,t). For this initial 
condition, we first compute the total amplitude P(t); the resulting expression reads 

2//s e^ 



(3.12) 



/o ~ cr 2 sinh fit' 

As a function of t, /i and <r, the mean and variation for the initial condition are 
explicitly 



M(t) 
V(t) 



a^oP^e^fx- 1 sinh fit - s^Exp{4/xt - ^f.J 



(3.13) 
P~ 2 (t). (3.14) 



a 2 sinh fit ' 

By taking the limit that fi goes to zero from these expressions, one may obtain the 
mean and variation of the system with \i = and they are 

s 



M (t) 
V (t) 



e t 



s et 



let 



I) 5 



It , X 

y( e " - 1) - 1 



(3.15) 
(3.16) 



where A = 2so/a 2 . For t <C A, one finds that the mean and variation are 

M (t) ~ s , V (t) ~ 2s 2 t/X, 



(3.17) 



which agree with those computed from the initial condition <f)(s, 0) 
the other hand, for t ^> A, we have 



M (t) 



o 2 t 



V (t) 



' 2 + \ 2 



S(s — so). On 



(3.18) 



Hence the mean and variation are independent of the initial parameter so and grow 
in a power law of time as the time gets larger. With the nonvanishing fi, one finds 
that as far as \fd\ <C 1, the mean and the variation behave in the same way as the 
case with [i = 0. On the other hand, for the case fit ^> 1, the mean and variation are 
approximated by 

M (t) ~ j^~Y (3-19) 



V(t) 



2 2^(2t+\) 
b t. 



J_( e 2M _ !) _ ! 



(3.20) 



For the negative /i, the large time (i.e. t ^> — ) behaviors are simply M(t) ~ 
cr 2 / (4|/i|) and V(t) ~ a 4 / (4/x) 2 and hence only in this case the mean and variation of 
the system are bounded from above all the time. 

One may further consider the case with nonvanishing potential U(s, t). Of course, 
it is not possible to solve the system explicitly without an explicit form of the po- 
tential. However, the influence of the potential to the time evolution of the system 
is not so complicated to understand. Since the role of the potential U appears as a 
weighting factor e _eC7 ^ , *- ) on the amplitude in each propagating interval [t,t + e], the 
larger positive value of U(s, t) at a certain position results in the more suppression of 
the amplitude, while the larger negative value leads to the more amplification of the 
probability amplitude. Especially, for the case that the harmonic potential term in 
( |3.5|) dominates the potential U at large s, e.g. 

\U{s,t) \ < fi 2 s/2a 2 as s -> large, (3.21) 

the large time features of the system mostly agrees with those in ( p.!9|) and ( |3.20| ). 
For the case that U is much larger than Uq = fi 2 s/2a 2 , the growth of the mean and 
variation are relatively suppressed. The other limiting case, —U/Uo ^> 1, the mean 
and variation grow much faster in time compared to the case U = 0. 

What happens to the case that the influence of the density cannot be presented 
in terms of the potential description? In this case, the amplification by the density 
multiplication in each generation are 0(1) and become huge after a large number 
of generations are passed. The distribution function of the system will be sharply 
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peaked at the maximum of the density after a large number of generations. This is 
characterized by the mean M(t) — > Max(p(s, t)) with V(t) — > 0. Thus, in this case 
one may conclude that there is no evolution of the system at all after a large time as 
far as the probability distribution of the system is concerned. 

One may speculate that the real life system presides in between the two limiting 
cases of the 0(1) variation of the density function and U — 0. 

4 Outlook 

As a simplified version of the real life system, we have modeled the self-replicating 
system whose dynamics can be projected onto the space of information configurations. 
Each object in the system carries an information element in the configuration space. 
The unit copying process for the replication of the base information is characterized 
by the number fluctuation with the mean /i and the standard deviation <t , and 
each of unit copying is assumed to be an independent event. We define the solution 
space by the condition that its element has an enough information to replicate itself 
to the next generation. To characterize the properties of the solution space, we 
define the density function of the solution space over the information configuration 
space. We have shown that the dynamics of the system is described with help of the 
time evolution kernel and may be mapped into the evolution of the four dimensional 
harmonic oscillator for the case of uniform density of the solution space. In addition, 
we have discussed the system dynamics for the various case of nonuniform density. 
Especially, we have proved that the time evolution of the mean and variation of the 
information numbers over the system configuration are growing with time with a few 
exceptions. This exception compiles the case of the negative fi, which is improbable 
in the realistic system. 

In our model, some details of the real life system are not included for simplicity. 
For example, the unit time intervals between generations are not all the same but 
depend upon the generations and the information number variable in the real life 
system. The number of descendents from a system object after a generation is also 
dependent on the time and the number variable. Moreover, there are two kinds of 
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the replication processes, which are nothing but asexual and sexual reproductions. 
There may be also a probability to fail to produce something else rather than base 
elements in the unit copying process, though it is expected to be extremely small. 
As mentioned earlier, it may be the case that the density of the solution space is a 
functional of the system distribution <fi(s,t). In this case, the equation governing the 
time evolution inevitably becomes nonlinear in <p(s, t) and this nonlinear effect may 
be important in understanding the decelerating force of the population growth due 
to the limitation of resources. 

These kind of fluctuations may be important in understanding the local dynamical 
evolution at a certain local time interval [t,t + At]. However, if the fluctuations are 
averaged over the long time, the effects of these deviations may be effectively ignored 
without introducing any serious change in global pictures. Furthermore, the detailed 
description of these variations may be incorporated by the slight modifications of the 
model presented. 

Nevertheless, the effect of these variation may be crucial in comparing the theory 
to the characteristics of the real life system because observations on the real life 
system is confined to a present short time interval. In this sense, the experimental 
implication of our model need more investigations because how to extract essential 
features of the real system dynamics from the present data depends considerably on 
the detailed form of local fluctuations. 
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